Shallow2Deep: Indoor scene modeling by single image understanding
نویسندگان
چکیده
منابع مشابه
Support surfaces prediction for indoor scene understanding
In this paper, we present an approach to predict the extent and height of supporting surfaces such as tables, chairs, and cabinet tops from a single RGBD image. We define support surfaces to be horizontal, planar surfaces that can physically support objects and humans. Given a RGBD image, our goal is to localize the height and full extent of such surfaces in 3D space. To achieve this, we create...
متن کامل3d Scene Modeling and Understanding from Image Sequences
Reconstructing and representing large-scale 3D scenes from image sequence have many important applications, including airborne or ground video surveillance for moving target extraction, automated 3D urban scene construction, airborne/ground traffic survey, and image-based modelling and rendering. To deal with two major challenges in large-scale 3D modelling huge amount of data and intensive com...
متن کاملModel-driven indoor scenes modeling from a single image
In this paper, we present a new approach of 3D indoor scenes modeling on single image. With a single input indoor image (including sofa, tea table, etc.), a 3D scene can be reconstructed using existing model library in two stages: image analysis and model retrieval. In the image analysis stage, we obtain the object information from input image using geometric reasoning technology combined with ...
متن کاملImage Segmentation and Scene Understanding Project
1. Introduction Scene or image understanding deals with the problem of making a computer " understand " the world behind the image. This can be done in a number of different ways. In this project, we will deal with a kind of problem of scene understanding, semantic image segmentation or pixel labeling. Multi-class image segmentation or pixel labeling does more than the task of object recognitio...
متن کاملJoint 2D-3D-Semantic Data for Indoor Scene Understanding
We present a dataset of large-scale indoor spaces that provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotations. The dataset covers over 6,000 m and contains over 70,000 RGB images, along with the corresponding depths, surface normals, semantic annotations, global XYZ images (all in forms of both regular and 360◦ e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition
سال: 2020
ISSN: 0031-3203
DOI: 10.1016/j.patcog.2020.107271